Overview

Dataset statistics

Number of variables29
Number of observations1628
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.0 MiB
Average record size in memory647.2 B

Variable types

NUM15
CAT11
BOOL3

Reproduction

Analysis started2020-06-25 11:03:31.730680
Analysis finished2020-06-25 11:04:07.495009
Duration35.76 seconds
Versionpandas-profiling v2.7.1
Command linepandas_profiling --config_file config.yaml [YOUR_FILE.csv]
Download configurationconfig.yaml
Behaviour has constant value "1" Constant
JobRole is highly correlated with DepartmentHigh correlation
Department is highly correlated with JobRoleHigh correlation
Id is uniformly distributed Uniform
Id has unique values Unique
NumCompaniesWorked has 198 (12.2%) zeros Zeros
TrainingTimesLastYear has 74 (4.5%) zeros Zeros
YearsAtCompany has 73 (4.5%) zeros Zeros
YearsInCurrentRole has 361 (22.2%) zeros Zeros
YearsSinceLastPromotion has 696 (42.8%) zeros Zeros
YearsWithCurrManager has 414 (25.4%) zeros Zeros

Variables

Id
Real number (ℝ≥0)

UNIFORM
UNIQUE
Distinct count1628
Unique (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean814.5
Minimum1
Maximum1628
Zeros0
Zeros (%)0.0%
Memory size12.8 KiB

Quantile statistics

Minimum1
5-th percentile82.35
Q1407.75
median814.5
Q31221.25
95-th percentile1546.65
Maximum1628
Range1627
Interquartile range (IQR)813.5

Descriptive statistics

Standard deviation470.1074345
Coefficient of variation (CV)0.577173032
Kurtosis-1.2
Mean814.5
Median Absolute Deviation (MAD)407
Skewness0
Sum1326006
Variance221001
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1627 1 0.1%
 
1092 1 0.1%
 
1072 1 0.1%
 
1074 1 0.1%
 
1076 1 0.1%
 
1078 1 0.1%
 
1080 1 0.1%
 
1082 1 0.1%
 
1084 1 0.1%
 
1086 1 0.1%
 
Other values (1618) 1618 99.4%
 
ValueCountFrequency (%) 
1 1 0.1%
 
2 1 0.1%
 
3 1 0.1%
 
4 1 0.1%
 
5 1 0.1%
 
ValueCountFrequency (%) 
1628 1 0.1%
 
1627 1 0.1%
 
1626 1 0.1%
 
1625 1 0.1%
 
1624 1 0.1%
 

Age
Real number (ℝ≥0)

Distinct count43
Unique (%)2.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean35.6455773955774
Minimum18
Maximum60
Zeros0
Zeros (%)0.0%
Memory size12.8 KiB

Quantile statistics

Minimum18
5-th percentile21
Q129
median34
Q342
95-th percentile53
Maximum60
Range42
Interquartile range (IQR)13

Descriptive statistics

Standard deviation9.481794355
Coefficient of variation (CV)0.2660019853
Kurtosis-0.4843929692
Mean35.6455774
Median Absolute Deviation (MAD)6
Skewness0.4315411626
Sum58031
Variance89.90442419
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
31 92 5.7%
 
29 85 5.2%
 
35 80 4.9%
 
34 73 4.5%
 
30 70 4.3%
 
26 68 4.2%
 
32 65 4.0%
 
33 60 3.7%
 
28 58 3.6%
 
36 55 3.4%
 
Other values (33) 922 56.6%
 
ValueCountFrequency (%) 
18 13 0.8%
 
19 23 1.4%
 
20 27 1.7%
 
21 30 1.8%
 
22 13 0.8%
 
ValueCountFrequency (%) 
60 3 0.2%
 
59 8 0.5%
 
58 15 0.9%
 
57 3 0.2%
 
56 9 0.6%
 

Attrition
Boolean

Distinct count2
Unique (%)0.1%
Missing0
Missing (%)0.0%
Memory size12.8 KiB
0
843
1
785
ValueCountFrequency (%) 
0 843 51.8%
 
1 785 48.2%
 

BusinessTravel
Categorical

Distinct count3
Unique (%)0.2%
Missing0
Missing (%)0.0%
Memory size12.8 KiB
Travel_Rarely
1105
Travel_Frequently
403
Non-Travel
 
120
ValueCountFrequency (%) 
Travel_Rarely 1105 67.9%
 
Travel_Frequently 403 24.8%
 
Non-Travel 120 7.4%
 

Length

Max length17
Mean length13.76904177
Min length10
ValueCountFrequency (%) 
Lowercase_Letter 11 64.7%
 
Uppercase_Letter 4 23.5%
 
Connector_Punctuation 1 5.9%
 
Dash_Punctuation 1 5.9%
 
ValueCountFrequency (%) 
Latin 15 88.2%
 
Common 2 11.8%
 
ValueCountFrequency (%) 
ASCII 17 100.0%
 

Department
Categorical

HIGH CORRELATION
Distinct count3
Unique (%)0.2%
Missing0
Missing (%)0.0%
Memory size12.8 KiB
Research & Development
979
Sales
568
Human Resources
 
81
ValueCountFrequency (%) 
Research & Development 979 60.1%
 
Sales 568 34.9%
 
Human Resources 81 5.0%
 

Length

Max length22
Mean length15.72051597
Min length5
ValueCountFrequency (%) 
Lowercase_Letter 14 70.0%
 
Uppercase_Letter 4 20.0%
 
Space_Separator 1 5.0%
 
Other_Punctuation 1 5.0%
 
ValueCountFrequency (%) 
Latin 18 90.0%
 
Common 2 10.0%
 
ValueCountFrequency (%) 
ASCII 20 100.0%
 

DistanceFromHome
Real number (ℝ≥0)

Distinct count29
Unique (%)1.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean9.735257985257986
Minimum1
Maximum29
Zeros0
Zeros (%)0.0%
Memory size12.8 KiB

Quantile statistics

Minimum1
5-th percentile1
Q12
median8
Q315
95-th percentile26
Maximum29
Range28
Interquartile range (IQR)13

Descriptive statistics

Standard deviation8.306546029
Coefficient of variation (CV)0.8532435445
Kurtosis-0.4260285132
Mean9.735257985
Median Absolute Deviation (MAD)6
Skewness0.8685624445
Sum15849
Variance68.99870694
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1 214 13.1%
 
2 211 13.0%
 
9 129 7.9%
 
10 90 5.5%
 
3 84 5.2%
 
7 82 5.0%
 
4 77 4.7%
 
5 76 4.7%
 
8 76 4.7%
 
6 56 3.4%
 
Other values (19) 533 32.7%
 
ValueCountFrequency (%) 
1 214 13.1%
 
2 211 13.0%
 
3 84 5.2%
 
4 77 4.7%
 
5 76 4.7%
 
ValueCountFrequency (%) 
29 41 2.5%
 
28 25 1.5%
 
27 13 0.8%
 
26 26 1.6%
 
25 33 2.0%
 

Education
Real number (ℝ≥0)

Distinct count5
Unique (%)0.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.8845208845208847
Minimum1
Maximum5
Zeros0
Zeros (%)0.0%
Memory size12.8 KiB

Quantile statistics

Minimum1
5-th percentile1
Q12
median3
Q34
95-th percentile4
Maximum5
Range4
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.020469638
Coefficient of variation (CV)0.3537743976
Kurtosis-0.5845982845
Mean2.884520885
Median Absolute Deviation (MAD)1
Skewness-0.3164886709
Sum4696
Variance1.041358283
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
3 644 39.6%
 
4 434 26.7%
 
2 306 18.8%
 
1 201 12.3%
 
5 43 2.6%
 
ValueCountFrequency (%) 
1 201 12.3%
 
2 306 18.8%
 
3 644 39.6%
 
4 434 26.7%
 
5 43 2.6%
 
ValueCountFrequency (%) 
5 43 2.6%
 
4 434 26.7%
 
3 644 39.6%
 
2 306 18.8%
 
1 201 12.3%
 

EducationField
Categorical

Distinct count6
Unique (%)0.4%
Missing0
Missing (%)0.0%
Memory size12.8 KiB
Life Sciences
623
Medical
521
Marketing
197
Technical Degree
162
Other
 
85
ValueCountFrequency (%) 
Life Sciences 623 38.3%
 
Medical 521 32.0%
 
Marketing 197 12.1%
 
Technical Degree 162 10.0%
 
Other 85 5.2%
 
Human Resources 40 2.5%
 

Length

Max length16
Mean length10.52579853
Min length5
ValueCountFrequency (%) 
Lowercase_Letter 17 65.4%
 
Uppercase_Letter 8 30.8%
 
Space_Separator 1 3.8%
 
ValueCountFrequency (%) 
Latin 25 96.2%
 
Common 1 3.8%
 
ValueCountFrequency (%) 
ASCII 26 100.0%
 

EmployeeNumber
Real number (ℝ≥0)

Distinct count1000
Unique (%)61.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1000.9858722358722
Minimum1
Maximum2068
Zeros0
Zeros (%)0.0%
Memory size12.8 KiB

Quantile statistics

Minimum1
5-th percentile90.35
Q1509.25
median977
Q31494
95-th percentile1953.95
Maximum2068
Range2067
Interquartile range (IQR)984.75

Descriptive statistics

Standard deviation585.4176938
Coefficient of variation (CV)0.5848411152
Kurtosis-1.136645074
Mean1000.985872
Median Absolute Deviation (MAD)491
Skewness0.08723216791
Sum1629605
Variance342713.8763
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1 5 0.3%
 
514 5 0.3%
 
1486 5 0.3%
 
478 5 0.3%
 
485 5 0.3%
 
1467 5 0.3%
 
1457 5 0.3%
 
488 5 0.3%
 
492 5 0.3%
 
494 5 0.3%
 
Other values (990) 1578 96.9%
 
ValueCountFrequency (%) 
1 5 0.3%
 
2 1 0.1%
 
4 5 0.3%
 
5 1 0.1%
 
8 1 0.1%
 
ValueCountFrequency (%) 
2068 1 0.1%
 
2062 1 0.1%
 
2061 1 0.1%
 
2060 1 0.1%
 
2057 1 0.1%
 
Distinct count4
Unique (%)0.2%
Missing0
Missing (%)0.0%
Memory size12.8 KiB
3
480
4
472
1
372
2
304
ValueCountFrequency (%) 
3 480 29.5%
 
4 472 29.0%
 
1 372 22.9%
 
2 304 18.7%
 

Length

Max length1
Mean length1
Min length1
ValueCountFrequency (%) 
Decimal_Number 4 100.0%
 
ValueCountFrequency (%) 
Common 4 100.0%
 
ValueCountFrequency (%) 
ASCII 4 100.0%
 

Gender
Categorical

Distinct count2
Unique (%)0.1%
Missing0
Missing (%)0.0%
Memory size12.8 KiB
Male
996
Female
632
ValueCountFrequency (%) 
Male 996 61.2%
 
Female 632 38.8%
 

Length

Max length6
Mean length4.776412776
Min length4
ValueCountFrequency (%) 
Lowercase_Letter 4 66.7%
 
Uppercase_Letter 2 33.3%
 
ValueCountFrequency (%) 
Latin 6 100.0%
 
ValueCountFrequency (%) 
ASCII 6 100.0%
 

JobInvolvement
Categorical

Distinct count4
Unique (%)0.2%
Missing0
Missing (%)0.0%
Memory size12.8 KiB
3
925
2
447
4
 
130
1
 
126
ValueCountFrequency (%) 
3 925 56.8%
 
2 447 27.5%
 
4 130 8.0%
 
1 126 7.7%
 

Length

Max length1
Mean length1
Min length1
ValueCountFrequency (%) 
Decimal_Number 4 100.0%
 
ValueCountFrequency (%) 
Common 4 100.0%
 
ValueCountFrequency (%) 
ASCII 4 100.0%
 

JobRole
Categorical

HIGH CORRELATION
Distinct count9
Unique (%)0.6%
Missing0
Missing (%)0.0%
Memory size12.8 KiB
Sales Executive
365
Research Scientist
341
Laboratory Technician
310
Sales Representative
172
Manufacturing Director
121
Other values (4)
319
ValueCountFrequency (%) 
Sales Executive 365 22.4%
 
Research Scientist 341 20.9%
 
Laboratory Technician 310 19.0%
 
Sales Representative 172 10.6%
 
Manufacturing Director 121 7.4%
 
Healthcare Representative 110 6.8%
 
Manager 90 5.5%
 
Human Resources 72 4.4%
 
Research Director 47 2.9%
 

Length

Max length25
Mean length18.11056511
Min length7
ValueCountFrequency (%) 
Lowercase_Letter 20 69.0%
 
Uppercase_Letter 8 27.6%
 
Space_Separator 1 3.4%
 
ValueCountFrequency (%) 
Latin 28 96.6%
 
Common 1 3.4%
 
ValueCountFrequency (%) 
ASCII 29 100.0%
 

JobSatisfaction
Categorical

Distinct count4
Unique (%)0.2%
Missing0
Missing (%)0.0%
Memory size12.8 KiB
3
529
4
434
1
356
2
309
ValueCountFrequency (%) 
3 529 32.5%
 
4 434 26.7%
 
1 356 21.9%
 
2 309 19.0%
 

Length

Max length1
Mean length1
Min length1
ValueCountFrequency (%) 
Decimal_Number 4 100.0%
 
ValueCountFrequency (%) 
Common 4 100.0%
 
ValueCountFrequency (%) 
ASCII 4 100.0%
 

MaritalStatus
Categorical

Distinct count3
Unique (%)0.2%
Missing0
Missing (%)0.0%
Memory size12.8 KiB
Married
681
Single
630
Divorced
317
ValueCountFrequency (%) 
Married 681 41.8%
 
Single 630 38.7%
 
Divorced 317 19.5%
 

Length

Max length8
Mean length6.807739558
Min length6
ValueCountFrequency (%) 
Lowercase_Letter 11 78.6%
 
Uppercase_Letter 3 21.4%
 
ValueCountFrequency (%) 
Latin 14 100.0%
 
ValueCountFrequency (%) 
ASCII 14 100.0%
 

MonthlyIncome
Real number (ℝ≥0)

Distinct count941
Unique (%)57.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5785.52457002457
Minimum1009
Maximum19999
Zeros0
Zeros (%)0.0%
Memory size12.8 KiB

Quantile statistics

Minimum1009
5-th percentile2048.9
Q12625
median4304
Q37124.25
95-th percentile16539.7
Maximum19999
Range18990
Interquartile range (IQR)4499.25

Descriptive statistics

Standard deviation4339.293147
Coefficient of variation (CV)0.7500258783
Kurtosis1.863724432
Mean5785.52457
Median Absolute Deviation (MAD)1896.5
Skewness1.561332502
Sum9418834
Variance18829465.02
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
2404 11 0.7%
 
5346 10 0.6%
 
2380 7 0.4%
 
2342 7 0.4%
 
2610 7 0.4%
 
9824 6 0.4%
 
2564 6 0.4%
 
2909 6 0.4%
 
2323 6 0.4%
 
2886 6 0.4%
 
Other values (931) 1556 95.6%
 
ValueCountFrequency (%) 
1009 5 0.3%
 
1051 1 0.1%
 
1052 1 0.1%
 
1081 5 0.3%
 
1118 5 0.3%
 
ValueCountFrequency (%) 
19999 1 0.1%
 
19973 1 0.1%
 
19926 1 0.1%
 
19859 5 0.3%
 
19847 1 0.1%
 

NumCompaniesWorked
Real number (ℝ≥0)

ZEROS
Distinct count10
Unique (%)0.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.7616707616707616
Minimum0
Maximum9
Zeros198
Zeros (%)12.2%
Memory size12.8 KiB

Quantile statistics

Minimum0
5-th percentile0
Q11
median2
Q34
95-th percentile8
Maximum9
Range9
Interquartile range (IQR)3

Descriptive statistics

Standard deviation2.54999527
Coefficient of variation (CV)0.92335238
Kurtosis-0.1540142041
Mean2.761670762
Median Absolute Deviation (MAD)1
Skewness0.9859588452
Sum4496
Variance6.502475879
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1 607 37.3%
 
0 198 12.2%
 
3 154 9.5%
 
2 152 9.3%
 
4 146 9.0%
 
6 98 6.0%
 
7 90 5.5%
 
5 70 4.3%
 
9 67 4.1%
 
8 46 2.8%
 
ValueCountFrequency (%) 
0 198 12.2%
 
1 607 37.3%
 
2 152 9.3%
 
3 154 9.5%
 
4 146 9.0%
 
ValueCountFrequency (%) 
9 67 4.1%
 
8 46 2.8%
 
7 90 5.5%
 
6 98 6.0%
 
5 70 4.3%
 

OverTime
Boolean

Distinct count2
Unique (%)0.1%
Missing0
Missing (%)0.0%
Memory size12.8 KiB
No
1000
Yes
628
ValueCountFrequency (%) 
No 1000 61.4%
 
Yes 628 38.6%
 

PercentSalaryHike
Real number (ℝ≥0)

Distinct count15
Unique (%)0.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean15.207616707616708
Minimum11
Maximum25
Zeros0
Zeros (%)0.0%
Memory size12.8 KiB

Quantile statistics

Minimum11
5-th percentile11
Q112
median14
Q318
95-th percentile22
Maximum25
Range14
Interquartile range (IQR)6

Descriptive statistics

Standard deviation3.686703092
Coefficient of variation (CV)0.2424247772
Kurtosis-0.2771366513
Mean15.20761671
Median Absolute Deviation (MAD)2
Skewness0.8311232246
Sum24758
Variance13.59177969
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
11 243 14.9%
 
12 222 13.6%
 
13 219 13.5%
 
14 199 12.2%
 
15 128 7.9%
 
16 103 6.3%
 
17 100 6.1%
 
18 93 5.7%
 
22 68 4.2%
 
19 68 4.2%
 
Other values (5) 185 11.4%
 
ValueCountFrequency (%) 
11 243 14.9%
 
12 222 13.6%
 
13 219 13.5%
 
14 199 12.2%
 
15 128 7.9%
 
ValueCountFrequency (%) 
25 18 1.1%
 
24 29 1.8%
 
23 32 2.0%
 
22 68 4.2%
 
21 56 3.4%
 
Distinct count2
Unique (%)0.1%
Missing0
Missing (%)0.0%
Memory size12.8 KiB
3
1375
4
 
253
ValueCountFrequency (%) 
3 1375 84.5%
 
4 253 15.5%
 

Length

Max length1
Mean length1
Min length1
ValueCountFrequency (%) 
Decimal_Number 2 100.0%
 
ValueCountFrequency (%) 
Common 2 100.0%
 
ValueCountFrequency (%) 
ASCII 2 100.0%
 

StockOptionLevel
Categorical

Distinct count4
Unique (%)0.2%
Missing0
Missing (%)0.0%
Memory size12.8 KiB
0
836
1
557
2
 
135
3
 
100
ValueCountFrequency (%) 
0 836 51.4%
 
1 557 34.2%
 
2 135 8.3%
 
3 100 6.1%
 

Length

Max length1
Mean length1
Min length1
ValueCountFrequency (%) 
Decimal_Number 4 100.0%
 
ValueCountFrequency (%) 
Common 4 100.0%
 
ValueCountFrequency (%) 
ASCII 4 100.0%
 

TotalWorkingYears
Real number (ℝ≥0)

Distinct count39
Unique (%)2.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean9.949017199017199
Minimum0
Maximum38
Zeros15
Zeros (%)0.9%
Memory size12.8 KiB

Quantile statistics

Minimum0
5-th percentile1
Q15
median8
Q313
95-th percentile25
Maximum38
Range38
Interquartile range (IQR)8

Descriptive statistics

Standard deviation7.482935655
Coefficient of variation (CV)0.7521281253
Kurtosis1.147729867
Mean9.949017199
Median Absolute Deviation (MAD)4
Skewness1.169746211
Sum16197
Variance55.99432602
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
10 185 11.4%
 
1 160 9.8%
 
6 135 8.3%
 
8 125 7.7%
 
7 109 6.7%
 
5 98 6.0%
 
9 90 5.5%
 
4 79 4.9%
 
2 61 3.7%
 
3 59 3.6%
 
Other values (29) 527 32.4%
 
ValueCountFrequency (%) 
0 15 0.9%
 
1 160 9.8%
 
2 61 3.7%
 
3 59 3.6%
 
4 79 4.9%
 
ValueCountFrequency (%) 
38 1 0.1%
 
37 4 0.2%
 
36 3 0.2%
 
35 2 0.1%
 
34 9 0.6%
 

TrainingTimesLastYear
Real number (ℝ≥0)

ZEROS
Distinct count7
Unique (%)0.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.7524570024570023
Minimum0
Maximum6
Zeros74
Zeros (%)4.5%
Memory size12.8 KiB

Quantile statistics

Minimum0
5-th percentile1
Q12
median3
Q33
95-th percentile5
Maximum6
Range6
Interquartile range (IQR)1

Descriptive statistics

Standard deviation1.288032994
Coefficient of variation (CV)0.4679575349
Kurtosis0.4875250536
Mean2.752457002
Median Absolute Deviation (MAD)1
Skewness0.4665967456
Sum4481
Variance1.659028993
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
2 610 37.5%
 
3 534 32.8%
 
4 139 8.5%
 
5 133 8.2%
 
1 78 4.8%
 
0 74 4.5%
 
6 60 3.7%
 
ValueCountFrequency (%) 
0 74 4.5%
 
1 78 4.8%
 
2 610 37.5%
 
3 534 32.8%
 
4 139 8.5%
 
ValueCountFrequency (%) 
6 60 3.7%
 
5 133 8.2%
 
4 139 8.5%
 
3 534 32.8%
 
2 610 37.5%
 

YearsAtCompany
Real number (ℝ≥0)

ZEROS
Distinct count36
Unique (%)2.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.092751842751842
Minimum0
Maximum37
Zeros73
Zeros (%)4.5%
Memory size12.8 KiB

Quantile statistics

Minimum0
5-th percentile1
Q12
median5
Q38
95-th percentile20
Maximum37
Range37
Interquartile range (IQR)6

Descriptive statistics

Standard deviation5.921167973
Coefficient of variation (CV)0.971838034
Kurtosis4.775165785
Mean6.092751843
Median Absolute Deviation (MAD)3
Skewness1.943081467
Sum9919
Variance35.06023016
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1 283 17.4%
 
5 183 11.2%
 
2 176 10.8%
 
3 139 8.5%
 
4 119 7.3%
 
10 117 7.2%
 
7 88 5.4%
 
8 82 5.0%
 
9 79 4.9%
 
6 79 4.9%
 
Other values (26) 283 17.4%
 
ValueCountFrequency (%) 
0 73 4.5%
 
1 283 17.4%
 
2 176 10.8%
 
3 139 8.5%
 
4 119 7.3%
 
ValueCountFrequency (%) 
37 1 0.1%
 
36 2 0.1%
 
34 1 0.1%
 
33 9 0.6%
 
32 1 0.1%
 

YearsInCurrentRole
Real number (ℝ≥0)

ZEROS
Distinct count19
Unique (%)1.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.614864864864865
Minimum0
Maximum18
Zeros361
Zeros (%)22.2%
Memory size12.8 KiB

Quantile statistics

Minimum0
5-th percentile0
Q11
median2
Q37
95-th percentile10
Maximum18
Range18
Interquartile range (IQR)6

Descriptive statistics

Standard deviation3.481050667
Coefficient of variation (CV)0.9629822407
Kurtosis0.9004442433
Mean3.614864865
Median Absolute Deviation (MAD)2
Skewness1.115556872
Sum5885
Variance12.11771375
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
2 437 26.8%
 
0 361 22.2%
 
7 219 13.5%
 
3 143 8.8%
 
4 109 6.7%
 
1 84 5.2%
 
8 76 4.7%
 
9 57 3.5%
 
10 26 1.6%
 
6 25 1.5%
 
Other values (9) 91 5.6%
 
ValueCountFrequency (%) 
0 361 22.2%
 
1 84 5.2%
 
2 437 26.8%
 
3 143 8.8%
 
4 109 6.7%
 
ValueCountFrequency (%) 
18 2 0.1%
 
17 3 0.2%
 
16 5 0.3%
 
15 5 0.3%
 
14 9 0.6%
 

YearsSinceLastPromotion
Real number (ℝ≥0)

ZEROS
Distinct count16
Unique (%)1.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.0491400491400493
Minimum0
Maximum15
Zeros696
Zeros (%)42.8%
Memory size12.8 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median1
Q32
95-th percentile9
Maximum15
Range15
Interquartile range (IQR)2

Descriptive statistics

Standard deviation3.138286755
Coefficient of variation (CV)1.53151404
Kurtosis4.06794973
Mean2.049140049
Median Absolute Deviation (MAD)1
Skewness2.071732248
Sum3336
Variance9.848843759
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0 696 42.8%
 
1 363 22.3%
 
2 200 12.3%
 
7 98 6.0%
 
4 56 3.4%
 
3 43 2.6%
 
6 39 2.4%
 
5 32 2.0%
 
9 21 1.3%
 
11 19 1.2%
 
Other values (6) 61 3.7%
 
ValueCountFrequency (%) 
0 696 42.8%
 
1 363 22.3%
 
2 200 12.3%
 
3 43 2.6%
 
4 56 3.4%
 
ValueCountFrequency (%) 
15 13 0.8%
 
14 11 0.7%
 
13 11 0.7%
 
12 6 0.4%
 
11 19 1.2%
 

YearsWithCurrManager
Real number (ℝ≥0)

ZEROS
Distinct count18
Unique (%)1.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.5515970515970516
Minimum0
Maximum17
Zeros414
Zeros (%)25.4%
Memory size12.8 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median2
Q37
95-th percentile10
Maximum17
Range17
Interquartile range (IQR)7

Descriptive statistics

Standard deviation3.494368623
Coefficient of variation (CV)0.983886565
Kurtosis0.516470596
Mean3.551597052
Median Absolute Deviation (MAD)2
Skewness1.015048751
Sum5782
Variance12.21061208
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0 414 25.4%
 
2 368 22.6%
 
7 221 13.6%
 
3 148 9.1%
 
8 98 6.0%
 
4 95 5.8%
 
1 94 5.8%
 
9 47 2.9%
 
5 29 1.8%
 
10 27 1.7%
 
Other values (8) 87 5.3%
 
ValueCountFrequency (%) 
0 414 25.4%
 
1 94 5.8%
 
2 368 22.6%
 
3 148 9.1%
 
4 95 5.8%
 
ValueCountFrequency (%) 
17 5 0.3%
 
16 1 0.1%
 
15 5 0.3%
 
14 13 0.8%
 
13 10 0.6%
 

CommunicationSkill
Real number (ℝ≥0)

Distinct count5
Unique (%)0.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.135749385749386
Minimum1
Maximum5
Zeros0
Zeros (%)0.0%
Memory size12.8 KiB

Quantile statistics

Minimum1
5-th percentile1
Q12
median3
Q34
95-th percentile5
Maximum5
Range4
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.408770357
Coefficient of variation (CV)0.4492611442
Kurtosis-1.291879611
Mean3.135749386
Median Absolute Deviation (MAD)1
Skewness-0.1069868546
Sum5105
Variance1.984633919
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
5 375 23.0%
 
4 342 21.0%
 
2 325 20.0%
 
3 313 19.2%
 
1 273 16.8%
 
ValueCountFrequency (%) 
1 273 16.8%
 
2 325 20.0%
 
3 313 19.2%
 
4 342 21.0%
 
5 375 23.0%
 
ValueCountFrequency (%) 
5 375 23.0%
 
4 342 21.0%
 
3 313 19.2%
 
2 325 20.0%
 
1 273 16.8%
 

Behaviour
Boolean

CONSTANT
REJECTED
Distinct count1
Unique (%)0.1%
Missing0
Missing (%)0.0%
Memory size12.8 KiB
1
1628
ValueCountFrequency (%) 
1 1628 100.0%
 

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Cramér's V (φc)

Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.

Missing values

Sample

First rows

IdAgeAttritionBusinessTravelDepartmentDistanceFromHomeEducationEducationFieldEmployeeNumberEnvironmentSatisfactionGenderJobInvolvementJobRoleJobSatisfactionMaritalStatusMonthlyIncomeNumCompaniesWorkedOverTimePercentSalaryHikePerformanceRatingStockOptionLevelTotalWorkingYearsTrainingTimesLastYearYearsAtCompanyYearsInCurrentRoleYearsSinceLastPromotionYearsWithCurrManagerCommunicationSkillBehaviour
01300Non-TravelResearch & Development23Medical5713Female3Laboratory Technician4Single25640No14301221176741
12360Travel_RarelyResearch & Development124Life Sciences16143Female3Manufacturing Director3Married46639Yes123272321121
23551Travel_RarelySales21Medical8423Male3Sales Executive4Single51604No1630123977351
34390Travel_RarelyResearch & Development241Life Sciences20141Male3Research Scientist4Single41087No1330182771741
45370Travel_RarelyResearch & Development33Other6893Male3Manufacturing Director3Married94341No15311021077811
56310Travel_RarelySales74Life Sciences9412Male2Sales Representative3Married23293No1530132775221
67321Travel_RarelyResearch & Development13Life Sciences3314Male2Laboratory Technician3Single37300Yes143042321211
78330Travel_RarelyResearch & Development44Medical15021Female2Laboratory Technician2Married38388No113085540251
89350Travel_FrequentlySales112Marketing11374Male3Sales Executive4Divorced49681No113153520241
910211Travel_RarelySales71Marketing17802Male3Sales Representative2Single26791No133013101051

Last rows

IdAgeAttritionBusinessTravelDepartmentDistanceFromHomeEducationEducationFieldEmployeeNumberEnvironmentSatisfactionGenderJobInvolvementJobRoleJobSatisfactionMaritalStatusMonthlyIncomeNumCompaniesWorkedOverTimePercentSalaryHikePerformanceRatingStockOptionLevelTotalWorkingYearsTrainingTimesLastYearYearsAtCompanyYearsInCurrentRoleYearsSinceLastPromotionYearsWithCurrManagerCommunicationSkillBehaviour
16181619291Travel_RarelySales93Marketing17522Female1Sales Representative2Single27601No133023222221
16191620261Travel_RarelySales83Technical Degree7964Male2Sales Executive1Single53266No173062431251
16201621331Travel_FrequentlyResearch & Development33Life Sciences7021Male3Research Scientist1Single33481Yes11301031089741
16211622201Travel_RarelyResearch & Development101Medical7014Male3Research Scientist3Single10091Yes113015101141
16221623491Travel_RarelySales113Marketing8403Female3Sales Executive4Married76541No183293987721
16231624421Travel_FrequentlyResearch & Development193Medical7523Male4Research Scientist3Divorced27596Yes123072222231
16241625551Travel_RarelySales21Medical8423Male3Sales Executive4Single51604No1630123977351
16251626251Travel_RarelySales92Life Sciences14391Male2Sales Representative1Married44003No123062322251
16261627291Travel_RarelyHuman Resources133Human Resources18441Male2Human Resources1Divorced23354Yes153343222051
16271628291Travel_RarelyResearch & Development181Medical3153Male2Research Scientist4Single23891Yes133043430121